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DETAILED ACTION 

1 . This is responsive to the Amendment filed on 26 December 2007. 

2. In the Amendment, it is erroneously noted that claims 1,3-6,8-11, and 1 8 - 
20 are pending. In fact, claims 1 - 7, 10 - 17, and 20 are pending and are considered 
below. 

Response to Amendment 

3. The objection to the Drawings is withdrawn in view of Applicant's amendments. 

Response to Arguments 

4. Applicant's arguments filed 26 December 2007 have been fully considered but 
they are not persuasive. 

Applicant argues that there is no teaching in Abe of inverse filtering an audio 
signal with LPC coefficients resulting in a residual signal. However, Abe discloses 
extracting residual energy which, in LPC as is well known in the speech coding art, is 
done by inverse filtering an audio signal with LPC coefficients. In other words, inverse 
filtering an audio signal with LPC coefficients is inherent in residual signal extraction. 
This fact is explicitly and clearly disclosed in Koishida et al (USPN 6,658,383) which 
states that the "LPC coefficients and the music signals ... are then applied to an inverse 
filter 330 that ... generates as output a residual signal". 

Applicant also argues that Abe does not teach "decimating the frame, thereby 
causing the frame to comprise a predetermined number of samples". The Examiner 
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respectfully disagrees. Abe discloses in Figure 2 blocks (frames) which can be sampled 
at a certain frequency (decimated) causing the blocks to comprise a predetermined 
number of samples related to the frequency ("a total of five hundreds and twelve (512) 
samples obtained ... by using a sampling frequency of 16 kHz", col. 6, lines 57-64). 

Claim Rejections - 35 USC § 102 

5. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 1 02 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in (1 ) an application for patent, published under section 1 22(b), by 
another filed in the United States before the invention by the applicant for patent or (2) a patent 
granted on an application for patent by another filed in the United States before the invention by the 
applicant for patent, except that an international application filed under the treaty defined in section 
351(a) shall have the effects for purposes of this subsection of an application filed in the United States 
only if the international application designated the United States and was published under Article 21(2) 
of such treaty in the English language. 

6. Claims 1 , 3 - 5, 1 1 , and 13 - 15 are rejected under 35 U.S.C. 102(e) as being 
anticipated by Abe et al (USPN 6,990,443). 

Claim 1: 

Abe discloses a method for classifying an audio signal (Abstract), said method 
comprising: 

calculating a plurality of linear prediction coefficients (LPC) for a portion of the 
audio signal (FIG. 3, item 13 and related text); 

inverse filtering the portion of the audio signal with the plurality of linear 
prediction coefficients (LPC), thereby resulting in a residual signal (FIG. 3, item 20 and 
related text); 
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measuring the residual energy of the residual signal (FIG. 3, items 20, 36, and 
related text); and 

comparing the residual energy to a threshold ("extracting the characteristic 
quantity of a signal ... and classifying the signal ... according to the characteristic 
quantity thereof, col. 3, lines 43-51 ). 

Claim 3: 

Abe discloses the method of claim 1 , wherein the portion of the audio signal 
comprises a frame (FIG. 2 and related text). 
Claim 4: 

Abe discloses the method of claim 3, further comprising: decimating the frame, 
thereby causing the frame to comprise a predetermined number of samples (FIG. 2 and 
related text). 

Claim 5: 

Abe discloses the method of claim 1 , further comprising: spectrally flattening the 
portion of the audio signal (FIG. 3, item 15 and related text). 
Claims 11, and 13- 15: 

Claims 1 1 , and 1 3 - 1 5 are similar in scope and content to claims 1 , and 3-5; 
they are rejected with the same rationale. 

Claim Rejections - 35 USC § 103 

7. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 
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(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

8. Claims 2 and 12 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Abe et al (USPN 6,990,443) in view of Koishida et al (USPN 6,658,383). 
Claim 2: 

Abe discloses the method of claim 1 , but he does not explicitly disclose classifying 
the portion of the audio signal as music, if the residual energy exceeds the 
threshold; and classifying the portion of the audio signal as speech, if the threshold 
exceeds the residual energy. 

In a similar audio classification method, Koishida teaches that "linear prediction- 
based techniques such as CELP can deliver high quality reproduction for speech 
signals, but yield unacceptable quality for the reproduction of music signals" (col. 1 , 
lines 33-37). 

It would have been obvious to one with ordinary skill in the art at the time of the 
invention to use Abe's residual energy to classify speech and music because as 
Koishida teaches LPC (linear predictive coding) techniques model speech better than 
they do music, therefore giving a smaller error (residual energy) for speech signals 
compared to the error for music signals. 

Claim 12: 

Claim 12 is similar in scope and content to claim 2 and is rejected with the same 
rationale. 



Application/Control Number: 10/757,791 Page 6 

Art Unit: 2626 

9. Claims 6, 10, 16, and 20 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Abe et al (USPN 6,990,443) in view of Rabiner, L ("Digital Processing 
of Speech Signals", Prentice-Hall, 1978). 
Claim 6: 

Abe discloses a method for classifying an audio signal (Abstract), said method 
comprising: 

taking a discrete Fourier transformation of a portion of the audio signal for a 
plurality of frequencies (FIG. 3, item 12 and related text); 

calculating a plurality of linear prediction coefficients (LPC) for the portion of the 
signal (FIG. 3, item 13 and related text); 

In a textbook on speech processing, Rabiner discloses measuring an inverse 
filter response for said plurality of frequencies with said plurality of linear prediction 
coefficients (LPC) (page 433, equation 8.102 and related text); and 

measuring a mean squared error between the discrete Fourier transformation of 
the portion of the audio signal for the plurality of frequencies and the inverse filter 
response (page 433, equation 8.103b and related text); 

It would have been obvious to one of ordinary skill in the art at the time of the 
invention to perform Abe's method in the frequency domain in order to "see peaks at the 
formant frequencies" (Rabiner, page 433, lines 3-5). 

Abe further discloses comparing the means squared error to a threshold 
("extracting the characteristic quantity of a signal ... and classifying the signal ... 
according to the characteristic quantity thereof, col. 3, lines 43-51); and 
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wherein the portion of the audio signal comprises a frame (block), decimating 
the frame (sampling with 16 kHz), thereby causing the frame to comprise a 
predetermined number of samples (512 samples) (FIG. 2 and related text, see also "a 
total of five hundreds and twelve (512) samples obtained ... by using a sampling 
frequency of 16 kHz", col. 6, lines 57-64). 

Claim 10: 

Abe and Rabiner disclose the method of claim 6, Abe further discloses: 
Spectrally flattening the portion of the audio signal (FIG. 3, item 15 and related text). 
Claims 16 and 20: 

Claims 16 and 20 are similar in scope and content to claims 6 and 10; they are 
rejected with the same rationale. 

1 0. Claims 7 and 1 7 rejected under 35 U.S.C. 1 03(a) as being unpatentable over 
Abe et al (USPN 6,990,443) in view of Rabiner, L ("Digital Processing of Speech 
Signals", Prentice-Hall, 1978, pages 433-435) and in further view of Koishida (USPN 
6,658,383). 
Claim 7: 

Abe and Rabiner disclose the method of claim 8, but they do not explicitly 
disclose: classifying the portion of the audio signal as music, if the mean squared error 
exceeds the threshold; and classifying the portion of the audio signal as speech, if the 
threshold exceeds the means squared error energy. 

In a similar audio classification method, Koishida teaches that "linear prediction- 
based techniques such as CELP can deliver high quality reproduction for speech 
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signals, but yield unacceptable quality for the reproduction of music signals" (col. 1 , 
lines 33-37). 

It would have been obvious to one with ordinary skill in the art at the time of the 
invention to use Abe and Rabiner's error to classify speech and music because as 
Koishida teaches LPC (linear predictive coding) techniques model speech better than 
they do music, therefore giving smaller error for speech signals compared to the error 
for music signals. 

Claim 17: 

Claim 17 is similar in scope and content to claim 7 and is rejected with the same 
rationale. 

Conclusion 

THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy set forth in 37 CFR 1 .136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the mailing date of this final action. 
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Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Samuel G. Neway whose telephone number is 571-270- 
1058. The examiner can normally be reached on Monday - Friday 8:30AM - 5:30PM 
EST. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, David R Hudspeth can be reached on 571-272-7843. The fax phone 
number for the organization where this application or proceeding is assigned is 571- 
273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

IS. G. N./ 

Examiner, Art Unit 2626 



/David R Hudspeth/ 
Supervisory Patent Examiner, Art Unit 2626 



